Overview

§0o|0z|1 \c|1o|1v|2i|2d|31|3[++]9|3 \v|3i|3s|3u|3a|3l|2i|2[-]s|2[-]a|2[-]t|2[-]i|1[-]o|1[-]n|1[-]s|0

The time-series line in the heading above is derived from current incidence cases of COVID-19 in Australia.

Column

Introduction

This web site is a joint effort of researchers at the South Western Sydney Clinical School and the Centre for Big Data Research in Health at the UNSW Faculty of Medicine, the Econometrics and Business Statistics Research Group of Monash University, and at the Ingham Institute for Applied Medical Research in Liverpool, Sydney.

The intent is to offer a range of principled epidemiological and statistical analyses and visualisations of current COVID-19 data which go beyond the now ubiquitous world maps and cumulative incidence charts.

The broad themes for the analyses and visualisations currently available are listed in the menu at the top of this page – more will be added in due course. For each theme there is an introductory page explaining the motivating ideas and methodology employed for each of the visualisations or analyses for that theme, which are available in the subsequent frames (the series of rectangles at the top of each page). Additional notes or commentary appear on the right of some pages.

An Australian focus with an international perspective

This site has been created by researchers at Australian universities, and hence the focus is on the situation in Australia, within the broader international context – we are, after all, all in this together. However, we hope that some of the analyses and visualisations on this site might be useful elsewhere, and to that end, all the R source code used to create this site if freely available – please see the Technical details tab above for details on software used, and where to find the source code.

Contributors

Creative Commons License
The content on this site is licensed under a Creative Commons Attribution 4.0 International License. If you wish to re-use any of the content, please retain the attribution which appears on each chart or set of charts, or otherwise provide that attribution alongside the content that you re-use.

In turn, we acknowledge RECON for providing financial support, and European Centre for Disease Control who have provided up to date data on the COVID19 pandemic.

Project has been led by Timothy Churches, Nicholas Tierney, with thanks to Stuart Lee, Dianne Cook, Miles McBain, and Rob Hyndman

Visualisations and analysis made available in the covidrecon package

Source code for this visualisation are available at covid-flexdashboard

Technical Details

Data Analysis has been entirely created within the R programming language using Rstudio.

Packages used include:

Incidence

Explanation

Case Incidence

This presents the daily number of cumulative cases, and daily cases of COVID19 for selected countries around the world. The data is drawn from the European CDC, which has collected data from various governing agencies around the world. There are some small discrepancies between this data and that given by the official Australian government results, but in the absence of up to date easily obtainable data from Australia, this is the highest quality data that we have.

Cumulative Incidence for selected countries


This graphic shows the cumulative cases of COVID19 for selected countries, against the number of days since a country exceeded 100 cumulative cases.

Note that Australia the curve is starting to flatten, as we have had fewer cases per day.

Although Australia does not have the same level of control over cases as countries like Japan and Singapore, it seems that the measures the Australian government have taken to limit travelers, and enforce physical distancing are working.

Daily Global Incidence


The number of cases per day recorded in Australia, as provided by the European CDC.

We note that there are some days where the number of cases appears to spike upwards, followed by a decrease the following day. This indicates that there may be some data discrepancies in how the European CDC is capturing data from WHO Situation Reports. It underlines the importance of nations providing reliable machine-readable access to their own COVID-19 data. By “machine-readable” we mean CSV or JSON data files which are automatically downlaodable, or an API which can be queried automatically to yield such data. Neither of those are difficult to establish, yet nearly all national governments have failed to provide such data, leaving it to third-party agencies abd citizen-science efforts to piece together the required data in a manner that permits ongoing analysis. There is, for example, no official machine-readable source of national COVID-19 data provided by the Australian government. NSW Health is the only State or Territory government that has made any effort in that direction by providing some machine-readable data, which we leverage in the \(R_{t}\) for NSW theme (see menu above).

Daily Global Incidence (fixed y-axis scale)


Forcing the y axis scales to be the same for all plots means that, compared to the previous plot where the y axis could change for each country, the country with the largest number cases, in this case, the USA, appears the same, and the rest of the plots appear smaller.

This give us an important context of the number of cases relative to each country. The bottom row of countries, Australia, Japan, Singapore, and New Zealand barely register.

National-level \(R_{t}\)

Explanation

To understand the spread of COVID19, we use a statistic called “R0” (R-nought). This is the number of people we expect to be infected from one COVID19 infected person arriving in a population.

We are estimating R0 from the available data, and are specifically estimating the effective reproduction number (different to basic reproduction number). This reflects the current state of a population, which may include some infections.

What does R0 mean?

If R0 is 1

  • One person arriving to a population could spread it to one other person. Those people could then spread it to one other person. This is easier to manage.

If R0 is 0.3

  • For every 10 people infected, it would spread to 3 other people.

If R0 is 2

  • Every person arriving in the population can infect 2 more people. This can quickly get out of control.

What do we want R0 to be?

  • We want the R0 to be below 1, and ideally as close to 0 as possible.

Estimating Effective reproduction number

We estimate the effective reproduction number using mathematical modelling. This means that the estimated numbers are dependent upon the mathematical model, and while they are our best possible guess at R0, they may be some inaccuracies.

These estimates reflect the number of new cases we expect from an infected person as they interact with a population. The higher the number, the worse the control of the infection, and the more people will be infected. We want this number to be below 1, and preferably as close to 0 as possible.

Misconceptions / mistakes

R0 is not a rate, and has no units of time.

This means it is not a rate of infection, and does not tell us how fast an infection spreads.

Limitations

These estimates are for all countries do not account for people who bring the infection to the population by travelling. This means that we are assuming all of these infections are spread by the community. However, Australia actually has quite a large proportion of people who are travelling from overseas with COVID19. This means that these estimates might be over-estimating R0. We have explored what is possible if we improve incorporate these imported cases, where we have this data for New South Wales.

Selected Countries


The effective R estimates for each country indicate that most countries are bringing the spread of the virus under control. However there is some wide variation. The measurement of effective R for Australia has decreased substantially, now falling below one, indicating that we have now started to control the outbreak.

Per Country


This is the same information presented as in the previous graphic, but with each country split into its own graph. This allows us to see the trajectory of effective R.

Most countries are decreasing, but we notice that Japan and Singapore are fluctuating, due to recent outbreaks.

Per Country (ribbons)


This shows the uncertainty around the estimate of the effective R. We notice that as time goes on, there is less uncertainty around the estimate, so we can more confident in our estimate at these later times.

Australia


Japan


Cool!

South Korea


Cool!

China


Cool!

Germany


Cool!

Spain


Cool!

France


Cool!

Italy


Cool!

Singapore


Cool!

United Kingdom


Cool!

USA


Cool!

\(R_{t}\) for NSW

Explanation

The total number of incident cases arising at timestep \(t\), \(I_t\), is the sum of the numbers of incident local (\(I_{t}^{local}\)) and imported (\(I_{t}^{imported}\)) cases,

\[ I_t = I_t^{local} + I_t^{imported} \]

It is assumed that, if imported cases exist, they can be distinguished from local cases, for instance through epidemiological investigations, so that \(I_{t}^{local}\) and \(I_{t}^{imported}\) are observed at each timestep.

\[\Lambda_t(w_s) = \sum_{s=1}^t (I_{t-s}^{local} + I_{t-s}^{imported}) w_s = \sum_{s=1}^t I_{t-s} w_s \] \[ \mathbb E(I_t^{local} | I_0, I_1, \ldots, I_{t-1}, w_s, T_t) = R_t\Lambda_t (w_s)\]

NSW – locally-acquired & overseas-acquired cases treated separately
(cases under investigation excluded)
parametric serial interval distribution


Commentary

NSW – locally-acquired & overseas-acquired cases treated separately
(cases under investigation excluded)
serial interval distribution estimated from data


Commentary


NSW – adjusting for potential under-ascertainment, locally-acquired & overseas-acquired cases treated separately, (cases under investigation excluded), parametric serial interval distribution


In these plots, the counts of incident cases with presumed local sources of infection have been inflated by a factor of 10, and the counts of cases with presumed overseas sources of infection inflated by a factor of 1.5. This mimics ten-fold under-ascertainment of locally-transmitted cases, and 50% under-ascertainment of inbound cases.


NSW – adjusting for potential under-ascertainment, locally-acquired & overseas-acquired cases treated separately, (cases under investigation excluded), serial interval distribution estimated from data


In these plots, the counts of incident cases with presumed local sources of infection have been inflated by a factor of 10, and the counts of cases with presumed overseas sources of infection inflated by a factor of 1.5. This mimics ten-fold under-ascertainment of locally-transmitted cases, and 50% under-ascertainment of inbound cases.


NSW – locally-acquired & overseas-acquired cases treated separately
(cases under investigation included)
parametric serial interval distribution


Commentary


NSW – locally-acquired & overseas-acquired cases treated separately
(cases under investigation included)
serial interval distribution estimated from data


Commentary


NSW – all cases treated as locally-acquired
parametric serial interval distribution


From data commentary blah blah blah.


NSW – all cases treated as locally-acquired
serial interval distribution estimated from data


From data commentary blah blah blah.